Overview

Dataset statistics

Number of variables34
Number of observations250306
Missing cells1280168
Missing cells (%)15.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory64.9 MiB
Average record size in memory272.0 B

Variable types

CAT18
NUM9
DATE3
UNSUPPORTED2
BOOL2

Reproduction

Analysis started2020-05-12 05:25:54.933794
Analysis finished2020-05-12 05:26:47.321055
Duration52.39 seconds
Versionpandas-profiling v2.8.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml

Warnings

clean_up_cost has constant value "0.0" Constant
inspector_name has a high cardinality: 173 distinct values High cardinality
violator_name has a high cardinality: 119992 distinct values High cardinality
violation_street_name has a high cardinality: 1791 distinct values High cardinality
mailing_address_str_name has a high cardinality: 37896 distinct values High cardinality
city has a high cardinality: 5184 distinct values High cardinality
state has a high cardinality: 59 distinct values High cardinality
violation_code has a high cardinality: 235 distinct values High cardinality
violation_description has a high cardinality: 258 distinct values High cardinality
state_fee is highly correlated with admin_feeHigh correlation
admin_fee is highly correlated with state_feeHigh correlation
judgment_amount is highly correlated with late_fee and 1 other fieldsHigh correlation
late_fee is highly correlated with judgment_amount and 1 other fieldsHigh correlation
balance_due is highly correlated with late_fee and 1 other fieldsHigh correlation
admin_fee is highly correlated with disposition and 2 other fieldsHigh correlation
disposition is highly correlated with admin_fee and 1 other fieldsHigh correlation
state_fee is highly correlated with disposition and 2 other fieldsHigh correlation
compliance_detail is highly correlated with admin_fee and 1 other fieldsHigh correlation
violation_zip_code has 250306 (100.0%) missing values Missing
mailing_address_str_number has 3602 (1.4%) missing values Missing
non_us_str_code has 250303 (> 99.9%) missing values Missing
hearing_date has 12491 (5.0%) missing values Missing
payment_date has 209193 (83.6%) missing values Missing
collection_status has 213409 (85.3%) missing values Missing
grafitti_status has 250305 (> 99.9%) missing values Missing
compliance has 90426 (36.1%) missing values Missing
violation_street_number is highly skewed (γ1 = 377.3438821) Skewed
mailing_address_str_number is highly skewed (γ1 = 37.93517109) Skewed
discount_amount is highly skewed (γ1 = 76.0604994) Skewed
ticket_id has unique values Unique
violation_zip_code is an unsupported type, check if it needs cleaning or further analysis Unsupported
zip_code is an unsupported type, check if it needs cleaning or further analysis Unsupported
late_fee has 105884 (42.3%) zeros Zeros
discount_amount has 249126 (99.5%) zeros Zeros
judgment_amount has 90621 (36.2%) zeros Zeros
payment_amount has 209193 (83.6%) zeros Zeros
balance_due has 111510 (44.5%) zeros Zeros

Variables

ticket_id
Real number (ℝ≥0)

UNIQUE

Distinct count250306
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean152665.5430992465
Minimum18645
Maximum366178
Zeros0
Zeros (%)0.0%
Memory size1.9 MiB

Quantile statistics

Minimum18645
5-th percentile31442.25
Q186549.25
median152597.5
Q3219888.75
95-th percentile272190.75
Maximum366178
Range347533
Interquartile range (IQR)133339.5

Descriptive statistics

Standard deviation77189.88288
Coefficient of variation (CV)0.5056143077
Kurtosis-1.198430396
Mean152665.5431
Median Absolute Deviation (MAD)66662.5
Skewness-0.01400732945
Sum3.821310143e+10
Variance5958278019
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
225171< 0.1%
 
310661< 0.1%
 
269681< 0.1%
 
535831< 0.1%
 
556301< 0.1%
 
494851< 0.1%
 
515321< 0.1%
 
617711< 0.1%
 
638181< 0.1%
 
597201< 0.1%
 
Other values (250296)250296> 99.9%
 
ValueCountFrequency (%) 
186451< 0.1%
 
186461< 0.1%
 
186481< 0.1%
 
186491< 0.1%
 
186501< 0.1%
 
ValueCountFrequency (%) 
3661781< 0.1%
 
3661761< 0.1%
 
3255621< 0.1%
 
3255611< 0.1%
 
3255601< 0.1%
 

agency_name
Categorical

Distinct count5
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.9 MiB
Buildings, Safety Engineering & Env Department
157784
Department of Public Works
74717
Health Department
 
8903
Detroit Police Department
 
8900
Neighborhood City Halls
 
2
ValueCountFrequency (%) 
Buildings, Safety Engineering & Env Department15778463.0%
 
Department of Public Works7471729.9%
 
Health Department89033.6%
 
Detroit Police Department89003.6%
 
Neighborhood City Halls2< 0.1%
 

Length

Max length46
Median length46
Mean length38.25159205
Min length17

inspector_name
Categorical

HIGH CARDINALITY

Distinct count173
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size1.9 MiB
Morris, John
 
17926
Steele, Jonathan
 
13237
Samaan, Neil J
 
12733
O'Neal, Claude
 
11591
Devaney, John
 
10769
Other values (168)
184050
ValueCountFrequency (%) 
Morris, John179267.2%
 
Steele, Jonathan132375.3%
 
Samaan, Neil J127335.1%
 
O'Neal, Claude115914.6%
 
Devaney, John107694.3%
 
Sims, Martinzie99764.0%
 
Sloane, Bennie J97303.9%
 
Hayes, Billy J94933.8%
 
Doetsch, James71082.8%
 
Zizi, Josue65872.6%
 
Other values (163)14115656.4%
 

Length

Max length26
Median length14
Mean length14.45237429
Min length10

violator_name
Categorical

HIGH CARDINALITY

Distinct count119992
Unique (%)47.9%
Missing34
Missing (%)< 0.1%
Memory size1.9 MiB
INVESTMENT, ACORN
 
809
INVESTMENT CO., ACORN
 
425
BANK, WELLS FARGO
 
328
MILLER, JOHN
 
205
SHIFMAN, ALLEN
 
192
Other values (119987)
248313
ValueCountFrequency (%) 
INVESTMENT, ACORN8090.3%
 
INVESTMENT CO., ACORN4250.2%
 
BANK, WELLS FARGO3280.1%
 
MILLER, JOHN2050.1%
 
SHIFMAN, ALLEN1920.1%
 
NEW YORK, BANK OF1840.1%
 
COMMISSION, DETROIT HOUSING1780.1%
 
STEHLIK, JERRY1620.1%
 
SNOW, GEORGE1470.1%
 
APARTMENTS, CARLTON1390.1%
 
Other values (119982)24750398.9%
 

Length

Max length79
Median length16
Mean length17.35222887
Min length3

violation_street_number
Real number (ℝ≥0)

SKEWED

Distinct count19175
Unique (%)7.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10649.858437272778
Minimum0.0
Maximum14154108.0
Zeros195
Zeros (%)0.1%
Memory size1.9 MiB

Quantile statistics

Minimum0
5-th percentile806
Q14739
median10244
Q315760
95-th percentile19960
Maximum14154108
Range14154108
Interquartile range (IQR)11021

Descriptive statistics

Standard deviation31887.33142
Coefficient of variation (CV)2.994155425
Kurtosis160011.9662
Mean10649.85844
Median Absolute Deviation (MAD)5513
Skewness377.3438821
Sum2665723466
Variance1016801905
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
15092400.1%
 
01950.1%
 
6001820.1%
 
193001530.1%
 
14011320.1%
 
15700123< 0.1%
 
8200115< 0.1%
 
20200112< 0.1%
 
9125109< 0.1%
 
14300107< 0.1%
 
Other values (19165)24883899.4%
 
ValueCountFrequency (%) 
01950.1%
 
128< 0.1%
 
262< 0.1%
 
32< 0.1%
 
44< 0.1%
 
ValueCountFrequency (%) 
141541081< 0.1%
 
61141751< 0.1%
 
11162922< 0.1%
 
11111112< 0.1%
 
4384441< 0.1%
 

violation_street_name
Categorical

HIGH CARDINALITY

Distinct count1791
Unique (%)0.7%
Missing0
Missing (%)0.0%
Memory size1.9 MiB
SEVEN MILE
 
3482
MCNICHOLS
 
3041
LIVERNOIS
 
2482
GRAND RIVER
 
1766
EVERGREEN
 
1749
Other values (1786)
237786
ValueCountFrequency (%) 
SEVEN MILE34821.4%
 
MCNICHOLS30411.2%
 
LIVERNOIS24821.0%
 
GRAND RIVER17660.7%
 
EVERGREEN17490.7%
 
FENKELL15120.6%
 
ASBURY PARK14910.6%
 
WARREN14430.6%
 
ARCHDALE13730.5%
 
EIGHT MILE13670.5%
 
Other values (1781)23060092.1%
 

Length

Max length18
Median length8
Mean length7.730589758
Min length3

violation_zip_code
Unsupported

MISSING
REJECTED
UNSUPPORTED

Missing250306
Missing (%)100.0%
Memory size1.9 MiB

mailing_address_str_number
Real number (ℝ≥0)

MISSING
SKEWED

Distinct count15826
Unique (%)6.4%
Missing3602
Missing (%)1.4%
Infinite0
Infinite (%)0.0%
Mean9149.787802386665
Minimum1.0
Maximum5111345.0
Zeros0
Zeros (%)0.0%
Memory size1.9 MiB

Quantile statistics

Minimum1
5-th percentile36
Q1544
median2456
Q312927.25
95-th percentile26656
Maximum5111345
Range5111344
Interquartile range (IQR)12383.25

Descriptive statistics

Standard deviation36020.3424
Coefficient of variation (CV)3.93674074
Kurtosis3053.83814
Mean9149.787802
Median Absolute Deviation (MAD)2314
Skewness37.93517109
Sum2257289250
Variance1297465067
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
21319340.8%
 
116620.7%
 
411810.5%
 
39790.4%
 
116770.3%
 
34765670.2%
 
55300.2%
 
1315280.2%
 
215130.2%
 
364710.2%
 
Other values (15816)23766294.9%
 
(Missing)36021.4%
 
ValueCountFrequency (%) 
116620.7%
 
24640.2%
 
39790.4%
 
411810.5%
 
55300.2%
 
ValueCountFrequency (%) 
51113451< 0.1%
 
35112191< 0.1%
 
32166321< 0.1%
 
25152382< 0.1%
 
25114553< 0.1%
 

mailing_address_str_name
Categorical

HIGH CARDINALITY

Distinct count37896
Unique (%)15.1%
Missing4
Missing (%)< 0.1%
Memory size1.9 MiB
PO BOX
 
8668
P.O. BOX
 
7182
GRAND RIVER
 
1249
LIVERNOIS
 
1205
W. MCNICHOLS
 
990
Other values (37891)
231008
ValueCountFrequency (%) 
PO BOX86683.5%
 
P.O. BOX71822.9%
 
GRAND RIVER12490.5%
 
LIVERNOIS12050.5%
 
W. MCNICHOLS9900.4%
 
GREENFIELD7620.3%
 
GRATIOT7570.3%
 
E. JEFFERSON7510.3%
 
HARPER7130.3%
 
P.O. Box7070.3%
 
Other values (37886)22731890.8%
 

Length

Max length30
Median length8
Mean length9.181346032
Min length1

city
Categorical

HIGH CARDINALITY

Distinct count5184
Unique (%)2.1%
Missing0
Missing (%)0.0%
Memory size1.9 MiB
DETROIT
136936
SOUTHFIELD
 
13436
Detroit
 
10496
detroit
 
4183
DEARBORN
 
3637
Other values (5179)
81618
ValueCountFrequency (%) 
DETROIT13693654.7%
 
SOUTHFIELD134365.4%
 
Detroit104964.2%
 
detroit41831.7%
 
DEARBORN36371.5%
 
FARMINGTON HILLS23290.9%
 
OAK PARK22160.9%
 
WARREN20000.8%
 
DET16570.7%
 
W. BLOOMFIELD16350.7%
 
Other values (5174)7178128.7%
 

Length

Max length39
Median length7
Mean length7.982649237
Min length1

state
Categorical

HIGH CARDINALITY

Distinct count59
Unique (%)< 0.1%
Missing93
Missing (%)< 0.1%
Memory size1.9 MiB
MI
228601
CA
 
5020
TX
 
2420
FL
 
2237
IL
 
1310
Other values (54)
 
10625
ValueCountFrequency (%) 
MI22860191.3%
 
CA50202.0%
 
TX24201.0%
 
FL22370.9%
 
IL13100.5%
 
SC13040.5%
 
OH9670.4%
 
NY6730.3%
 
MN6320.3%
 
GA5350.2%
 
Other values (49)65142.6%
 

Length

Max length3
Median length2
Mean length2.000371545
Min length2

zip_code
Unsupported

REJECTED
UNSUPPORTED

Missing1
Missing (%)< 0.1%
Memory size1.9 MiB

non_us_str_code
Categorical

MISSING

Distinct count2
Unique (%)66.7%
Missing250303
Missing (%)> 99.9%
Memory size1.9 MiB
ONTARIO, Canada
2
, Australia
1
ValueCountFrequency (%) 
ONTARIO, Canada2< 0.1%
 
, Australia1< 0.1%
 
(Missing)250303> 99.9%
 

Length

Max length15
Median length3
Mean length3.000127844
Min length3

country
Categorical

Distinct count5
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.9 MiB
USA
250293
Cana
 
7
Aust
 
3
Egyp
 
2
Germ
 
1
ValueCountFrequency (%) 
USA250293> 99.9%
 
Cana7< 0.1%
 
Aust3< 0.1%
 
Egyp2< 0.1%
 
Germ1< 0.1%
 

Length

Max length4
Median length3
Mean length3.000051936
Min length3
Distinct count86979
Unique (%)34.7%
Missing0
Missing (%)0.0%
Memory size1.9 MiB
Minimum1938-10-09 15:30:00
Maximum2011-12-31 16:15:00
Histogram

hearing_date
Date

MISSING

Distinct count6222
Unique (%)2.6%
Missing12491
Missing (%)5.0%
Memory size1.9 MiB
Minimum2005-01-27 09:00:00
Maximum2017-01-27 10:30:00
Histogram

violation_code
Categorical

HIGH CARDINALITY

Distinct count235
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size1.9 MiB
9-1-36(a)
99091
9-1-81(a)
43471
22-2-88
28720
9-1-104
22536
22-2-88(b)
 
7238
Other values (230)
49250
ValueCountFrequency (%) 
9-1-36(a)9909139.6%
 
9-1-81(a)4347117.4%
 
22-2-882872011.5%
 
9-1-104225369.0%
 
22-2-88(b)72382.9%
 
22-2-4553942.2%
 
9-1-43(a) - (Dwellin53322.1%
 
9-1-10550722.0%
 
9-1-110(a)48141.9%
 
22-2-2237551.5%
 
Other values (225)248839.9%
 

Length

Max length20
Median length9
Mean length8.841989405
Min length6

violation_description
Categorical

HIGH CARDINALITY

Distinct count258
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size1.9 MiB
Failure of owner to obtain certificate of compliance
99091
Failure to obtain certificate of registration for rental property
43471
Failure of owner to keep property, its sidewalks, or adjoining public property free from solid waste
28719
Excessive weeds or plant growth one- or two-family dwelling or commercial Building
22536
Allowing bulk solid waste to lie or accumulate on or about the premises
 
7238
Other values (253)
49251
ValueCountFrequency (%) 
Failure of owner to obtain certificate of compliance9909139.6%
 
Failure to obtain certificate of registration for rental property4347117.4%
 
Failure of owner to keep property, its sidewalks, or adjoining public property free from solid waste2871911.5%
 
Excessive weeds or plant growth one- or two-family dwelling or commercial Building225369.0%
 
Allowing bulk solid waste to lie or accumulate on or about the premises72382.9%
 
Violation of time limit for approved containers to remain at curbside - early or late53942.2%
 
Rodent harborage one-or two-family dwelling or commercial building50722.0%
 
Inoperable motor vehicle(s) one- or two-family dwelling or commercial building48141.9%
 
Bulk solid waste deposited more than 24 hours before designated time39441.6%
 
Failure of owner of one- or two-family dwelling to comply with an emergency or imminent danger order concerining an unsafe or unsanitar36131.4%
 
Other values (248)2641410.6%
 

Length

Max length241
Median length65
Mean length69.82484639
Min length20

disposition
Categorical

HIGH CORRELATION

Distinct count9
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.9 MiB
Responsible by Default
138340
Not responsible by Dismissal
48695
Not responsible by City Dismissal
34401
Responsible by Admission
 
13701
Responsible by Determination
 
7644
Other values (4)
 
7525
ValueCountFrequency (%) 
Responsible by Default13834055.3%
 
Not responsible by Dismissal4869519.5%
 
Not responsible by City Dismissal3440113.7%
 
Responsible by Admission137015.5%
 
Responsible by Determination76443.1%
 
Not responsible by Determination66392.7%
 
PENDING JUDGMENT3870.2%
 
SET-ASIDE (PENDING JUDGMENT)3040.1%
 
Responsible (Fine Waived) by Deter1950.1%
 

Length

Max length34
Median length22
Mean length25.24434492
Min length16

fine_amount
Real number (ℝ≥0)

Distinct count43
Unique (%)< 0.1%
Missing1
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean374.42343540880125
Minimum0.0
Maximum10000.0
Zeros195
Zeros (%)0.1%
Memory size1.9 MiB

Quantile statistics

Minimum0
5-th percentile50
Q1200
median250
Q3250
95-th percentile1000
Maximum10000
Range10000
Interquartile range (IQR)50

Descriptive statistics

Standard deviation707.1958066
Coefficient of variation (CV)1.888759462
Kurtosis58.96312069
Mean374.4234354
Median Absolute Deviation (MAD)0
Skewness6.42730943
Sum93720058
Variance500125.9089
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
25014124556.4%
 
502837811.3%
 
100214358.6%
 
200186837.5%
 
500112444.5%
 
100079343.2%
 
30066492.7%
 
350062692.5%
 
250026191.0%
 
2522750.9%
 
Other values (33)35741.4%
 
ValueCountFrequency (%) 
01950.1%
 
11< 0.1%
 
202< 0.1%
 
2522750.9%
 
502837811.3%
 
ValueCountFrequency (%) 
100003570.1%
 
80001< 0.1%
 
700013< 0.1%
 
50002850.1%
 
350062692.5%
 

admin_fee
Categorical

HIGH CORRELATION
HIGH CORRELATION

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.9 MiB
20
159880
0
90426
ValueCountFrequency (%) 
2015988063.9%
 
09042636.1%
 

Length

Max length4
Median length4
Mean length3.638738184
Min length3

state_fee
Categorical

HIGH CORRELATION
HIGH CORRELATION

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.9 MiB
10
159880
0
90426
ValueCountFrequency (%) 
1015988063.9%
 
09042636.1%
 

Length

Max length4
Median length4
Mean length3.638738184
Min length3

late_fee
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct count37
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean21.494505924748108
Minimum0.0
Maximum1000.0
Zeros105884
Zeros (%)42.3%
Memory size1.9 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median10
Q325
95-th percentile50
Maximum1000
Range1000
Interquartile range (IQR)25

Descriptive statistics

Standard deviation56.46426326
Coefficient of variation (CV)2.626916081
Kurtosis89.31129566
Mean21.49450592
Median Absolute Deviation (MAD)10
Skewness7.785447985
Sum5380203.8
Variance3188.213025
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
010588442.3%
 
257962131.8%
 
5176557.1%
 
10125165.0%
 
20113544.5%
 
5065452.6%
 
10048031.9%
 
35037851.5%
 
3036801.5%
 
25015060.6%
 
Other values (27)29571.2%
 
ValueCountFrequency (%) 
010588442.3%
 
0.11< 0.1%
 
2.512230.5%
 
5176557.1%
 
9.52< 0.1%
 
ValueCountFrequency (%) 
10001950.1%
 
8001< 0.1%
 
70013< 0.1%
 
50085< 0.1%
 
35037851.5%
 

discount_amount
Real number (ℝ≥0)

SKEWED
ZEROS

Distinct count13
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.12516679584188953
Minimum0.0
Maximum350.0
Zeros249126
Zeros (%)99.5%
Memory size1.9 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum350
Range350
Interquartile range (IQR)0

Descriptive statistics

Standard deviation3.430177755
Coefficient of variation (CV)27.40485392
Kurtosis7220.032173
Mean0.1251667958
Median Absolute Deviation (MAD)0
Skewness76.0604994
Sum31330
Variance11.76611943
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
024912699.5%
 
256050.2%
 
51670.1%
 
101550.1%
 
201350.1%
 
5043< 0.1%
 
319< 0.1%
 
3017< 0.1%
 
10016< 0.1%
 
35015< 0.1%
 
Other values (3)8< 0.1%
 
ValueCountFrequency (%) 
024912699.5%
 
319< 0.1%
 
51670.1%
 
101550.1%
 
131< 0.1%
 
ValueCountFrequency (%) 
35015< 0.1%
 
2506< 0.1%
 
10016< 0.1%
 
5043< 0.1%
 
401< 0.1%
 

clean_up_cost
Boolean

CONSTANT
REJECTED

Distinct count1
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.9 MiB
0
250306
ValueCountFrequency (%) 
0250306100.0%
 

judgment_amount
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct count57
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean268.6853563238596
Minimum0.0
Maximum11030.0
Zeros90621
Zeros (%)36.2%
Memory size1.9 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median140
Q3305
95-th percentile580
Maximum11030
Range11030
Interquartile range (IQR)305

Descriptive statistics

Standard deviation626.9152124
Coefficient of variation (CV)2.333268999
Kurtosis86.07829501
Mean268.6853563
Median Absolute Deviation (MAD)140
Skewness7.605726876
Sum67253556.8
Variance393022.6835
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
09062136.2%
 
3057962131.8%
 
85176557.1%
 
140125165.0%
 
250113554.5%
 
28071772.9%
 
58065452.6%
 
113048031.9%
 
388037851.5%
 
36036801.5%
 
Other values (47)125485.0%
 
ValueCountFrequency (%) 
09062136.2%
 
31.11< 0.1%
 
501< 0.1%
 
551550.1%
 
57.512230.5%
 
ValueCountFrequency (%) 
110301950.1%
 
88301< 0.1%
 
773013< 0.1%
 
553085< 0.1%
 
388037851.5%
 

payment_amount
Real number (ℝ≥0)

ZEROS

Distinct count533
Unique (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean48.898986440596715
Minimum0.0
Maximum11075.0
Zeros209193
Zeros (%)83.6%
Memory size1.9 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile280
Maximum11075
Range11075
Interquartile range (IQR)0

Descriptive statistics

Standard deviation222.422425
Coefficient of variation (CV)4.548610128
Kurtosis390.3978971
Mean48.89898644
Median Absolute Deviation (MAD)0
Skewness15.50633584
Sum12239709.7
Variance49471.73513
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
020919383.6%
 
30577973.1%
 
28064152.6%
 
25545351.8%
 
8527281.1%
 
14021760.9%
 
13020560.8%
 
25016990.7%
 
8016350.7%
 
12015850.6%
 
Other values (523)104874.2%
 
ValueCountFrequency (%) 
020919383.6%
 
0.51< 0.1%
 
34< 0.1%
 
510< 0.1%
 
71< 0.1%
 
ValueCountFrequency (%) 
110751< 0.1%
 
110306< 0.1%
 
100303< 0.1%
 
77601< 0.1%
 
74101< 0.1%
 

balance_due
Real number (ℝ)

HIGH CORRELATION
ZEROS

Distinct count606
Unique (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean222.44905795306545
Minimum-7750.0
Maximum11030.0
Zeros111510
Zeros (%)44.5%
Memory size1.9 MiB

Quantile statistics

Minimum-7750
5-th percentile0
Q10
median25
Q3305
95-th percentile580
Maximum11030
Range18780
Interquartile range (IQR)305

Descriptive statistics

Standard deviation606.3940102
Coefficient of variation (CV)2.725990462
Kurtosis96.10064154
Mean222.449058
Median Absolute Deviation (MAD)25
Skewness8.008018813
Sum55680333.9
Variance367713.6956
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
011151044.5%
 
3056728126.9%
 
85140925.6%
 
14093243.7%
 
25089673.6%
 
2576923.1%
 
58056732.3%
 
113041511.7%
 
36034021.4%
 
388033861.4%
 
Other values (596)148285.9%
 
ValueCountFrequency (%) 
-77501< 0.1%
 
-48201< 0.1%
 
-38801< 0.1%
 
-37951< 0.1%
 
-35801< 0.1%
 
ValueCountFrequency (%) 
110301870.1%
 
105301< 0.1%
 
88301< 0.1%
 
773013< 0.1%
 
553078< 0.1%
 

payment_date
Date

MISSING

Distinct count2307
Unique (%)5.6%
Missing209193
Missing (%)83.6%
Memory size1.9 MiB
Minimum2005-01-25 00:00:00
Maximum2017-01-25 00:00:00
Histogram

payment_status
Categorical

Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.9 MiB
NO PAYMENT APPLIED
209193
PAID IN FULL
 
31931
PARTIAL PAYMENT APPLIED
 
9182
ValueCountFrequency (%) 
NO PAYMENT APPLIED20919383.6%
 
PAID IN FULL3193112.8%
 
PARTIAL PAYMENT APPLIED91823.7%
 

Length

Max length23
Median length18
Mean length17.41800836
Min length12

collection_status
Categorical

MISSING

Distinct count1
Unique (%)< 0.1%
Missing213409
Missing (%)85.3%
Memory size1.9 MiB
IN COLLECTION
36897
ValueCountFrequency (%) 
IN COLLECTION3689714.7%
 
(Missing)21340985.3%
 

Length

Max length13
Median length3
Mean length4.474075731
Min length3

grafitti_status
Categorical

MISSING

Distinct count1
Unique (%)100.0%
Missing250305
Missing (%)> 99.9%
Memory size1.9 MiB
GRAFFITI TICKET
1
ValueCountFrequency (%) 
GRAFFITI TICKET1< 0.1%
 
(Missing)250305> 99.9%
 

Length

Max length15
Median length3
Mean length3.000047941
Min length3

compliance_detail
Categorical

HIGH CORRELATION

Distinct count10
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.9 MiB
non-compliant by no payment
129267
not responsible by disposition
89735
non-compliant by late payment more than 1 month
 
19016
compliant by late payment within 1 month
 
6300
compliant by on-time payment
 
3880
Other values (5)
 
2108
ValueCountFrequency (%) 
non-compliant by no payment12926751.6%
 
not responsible by disposition8973535.9%
 
non-compliant by late payment more than 1 month190167.6%
 
compliant by late payment within 1 month63002.5%
 
compliant by on-time payment38801.6%
 
compliant by early payment9920.4%
 
not responsible by pending judgment disposition6910.3%
 
compliant by no fine1950.1%
 
compliant by payment with no scheduled hearing1610.1%
 
compliant by payment on unknown date69< 0.1%
 

Length

Max length47
Median length27
Mean length29.9981223
Min length20

compliance
Boolean

MISSING

Distinct count2
Unique (%)< 0.1%
Missing90426
Missing (%)36.1%
Memory size1.9 MiB
0
148283
1
 
11597
(Missing)
90426
ValueCountFrequency (%) 
014828359.2%
 
1115974.6%
 
(Missing)9042636.1%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

Sample

First rows

ticket_idagency_nameinspector_nameviolator_nameviolation_street_numberviolation_street_nameviolation_zip_codemailing_address_str_numbermailing_address_str_namecitystatezip_codenon_us_str_codecountryticket_issued_datehearing_dateviolation_codeviolation_descriptiondispositionfine_amountadmin_feestate_feelate_feediscount_amountclean_up_costjudgment_amountpayment_amountbalance_duepayment_datepayment_statuscollection_statusgrafitti_statuscompliance_detailcompliance
022056Buildings, Safety Engineering & Env DepartmentSims, MartinzieINVESTMENT INC., MIDWEST MORTGAGE2900.0TYLERNaN3.0S. WICKERCHICAGOIL60606NaNUSA2004-03-16 11:40:002005-03-21 10:30:009-1-36(a)Failure of owner to obtain certificate of complianceResponsible by Default250.020.010.025.00.00.0305.00.0305.0NaTNO PAYMENT APPLIEDNaNNaNnon-compliant by no payment0.0
127586Buildings, Safety Engineering & Env DepartmentWilliams, DarrinMichigan, Covenant House4311.0CENTRALNaN2959.0Martin Luther KingDetroitMI48208NaNUSA2004-04-23 12:30:002005-05-06 13:30:0061-63.0600Failed To Secure Permit For Lawful Use Of BuildingResponsible by Determination750.020.010.075.00.00.0855.0780.075.02005-06-02PAID IN FULLNaNNaNcompliant by late payment within 1 month1.0
222062Buildings, Safety Engineering & Env DepartmentSims, MartinzieSANDERS, DERRON1449.0LONGFELLOWNaN23658.0P.O. BOXDETROITMI48223NaNUSA2004-04-26 13:40:002005-03-29 10:30:009-1-36(a)Failure of owner to obtain certificate of complianceNot responsible by Dismissal250.00.00.00.00.00.00.00.00.0NaTNO PAYMENT APPLIEDNaNNaNnot responsible by dispositionNaN
322084Buildings, Safety Engineering & Env DepartmentSims, MartinzieMOROSI, MIKE1441.0LONGFELLOWNaN5.0ST. CLAIRDETROITMI48214NaNUSA2004-04-26 13:30:00NaT9-1-36(a)Failure of owner to obtain certificate of complianceNot responsible by City Dismissal250.00.00.00.00.00.00.00.00.0NaTNO PAYMENT APPLIEDNaNNaNnot responsible by dispositionNaN
422093Buildings, Safety Engineering & Env DepartmentSims, MartinzieNATHANIEL, NEAL2449.0CHURCHILLNaN7449.0CHURCHILLDETROITMI48206NaNUSA2004-04-26 13:00:002005-03-29 10:30:009-1-36(a)Failure of owner to obtain certificate of complianceNot responsible by Dismissal250.00.00.00.00.00.00.00.00.0NaTNO PAYMENT APPLIEDNaNNaNnot responsible by dispositionNaN
522046Buildings, Safety Engineering & Env DepartmentSims, MartinzieKASIMU, UKWELI6478.0NORTHFIELDNaN2755.0E. 17THLOG BEACHCA908041512NaNUSA2004-05-01 11:50:002005-03-21 10:30:009-1-36(a)Failure of owner to obtain certificate of complianceResponsible by Default250.020.010.025.00.00.0305.00.0305.0NaTNO PAYMENT APPLIEDNaNNaNnon-compliant by no payment0.0
618738Buildings, Safety Engineering & Env DepartmentWilliams, DarrinDeerwood Development Group Inc, Deer8027.0BRENTWOODNaN476.0GarfieldClintonMI48038NaNUSA2004-06-14 14:15:002005-02-22 15:00:0061-63.0500Failed To Secure Permit For Lawful Use Of LandResponsible by Default750.020.010.075.00.00.0855.00.0855.0NaTNO PAYMENT APPLIEDNaNNaNnon-compliant by no payment0.0
718735Buildings, Safety Engineering & Env DepartmentWilliams, DarrinRafee Auto Services L.L.C., RAF8228.0MT ELLIOTTNaN8228.0Mt. ElliottDetroitMI48211NaNUSA2004-06-16 12:30:002005-02-22 15:00:0061-63.0100Noncompliance/Grant Condition/BZA/BSEResponsible by Default100.020.010.010.00.00.0140.00.0140.0NaTNO PAYMENT APPLIEDNaNNaNnon-compliant by no payment0.0
818733Buildings, Safety Engineering & Env DepartmentWilliams, DarrinRafee Auto Services L.L.C., RAF8228.0MT ELLIOTTNaN8228.0Mt. ElliottDetroitMI48211NaNUSA2004-06-16 12:25:002005-02-22 15:00:0061-63.0100Noncompliance/Grant Condition/BZA/BSEResponsible by Default100.020.010.010.00.00.0140.00.0140.0NaTNO PAYMENT APPLIEDNaNNaNnon-compliant by no payment0.0
928204Buildings, Safety Engineering & Env DepartmentWilliams, DarrinInc, Nanno15307.0SEVEN MILENaN1537.0E. Seven MileDetroitMI48205NaNUSA2004-07-12 13:30:002005-05-31 13:30:0061-63.0600Failed To Secure Permit For Lawful Use Of BuildingResponsible by Default750.020.010.075.00.00.0855.00.0855.0NaTNO PAYMENT APPLIEDNaNNaNnon-compliant by no payment0.0

Last rows

ticket_idagency_nameinspector_nameviolator_nameviolation_street_numberviolation_street_nameviolation_zip_codemailing_address_str_numbermailing_address_str_namecitystatezip_codenon_us_str_codecountryticket_issued_datehearing_dateviolation_codeviolation_descriptiondispositionfine_amountadmin_feestate_feelate_feediscount_amountclean_up_costjudgment_amountpayment_amountbalance_duepayment_datepayment_statuscollection_statusgrafitti_statuscompliance_detailcompliance
250296366178Buildings, Safety Engineering & Env DepartmentBush, WesleyMARK JACKSON8020.0PURITANNaN251.0HEYDENDETROTMI48219NaNUSA2006-07-28 14:00:002016-10-06 09:00:009-1-111Failure of owner to remove graffiti or maintain or restore property free of graffiti.Not responsible by City Dismissal100.00.00.00.00.00.00.00.00.0NaTNO PAYMENT APPLIEDNaNGRAFFITI TICKETnot responsible by dispositionNaN
250297366176Buildings, Safety Engineering & Env DepartmentBush, WesleyMARK JACKSON8020.0PURITANNaN251.0HEYDENDETROTMI48219NaNUSA2006-07-28 14:00:002016-10-06 09:00:009-1-36(a)Failure of owner to obtain certificate of complianceNot responsible by City Dismissal250.00.00.00.00.00.00.00.00.0NaTNO PAYMENT APPLIEDNaNNaNnot responsible by dispositionNaN
250298325560Buildings, Safety Engineering & Env DepartmentBush, WesleyWESTGATE TERRACE APARTMENTS LLC10701.0MEYERS RDNaN1715.0MEYERSDETROITMI48235NaNUSA2010-12-02 11:00:002015-01-06 09:00:009-1-43(a) - (StructuFail to comply with an Emergency or imminent danger order concerining an unsafe or unsanitary structure or unlawful occupancy (all other structures, except buildings with five (5) or more stories)Not responsible by City Dismissal1000.00.00.00.00.00.00.00.00.0NaTNO PAYMENT APPLIEDNaNNaNnot responsible by dispositionNaN
250299325556Buildings, Safety Engineering & Env DepartmentBush, WesleyWESTGATE TERRACE APARTMENTS LLC10701.0MEYERS RDNaN1715.0MEYERSDETROITMI48235NaNUSA2010-12-02 11:00:002015-01-06 09:00:009-1-43(a) - (StructuFail to comply with an Emergency or imminent danger order concerining an unsafe or unsanitary structure or unlawful occupancy (all other structures, except buildings with five (5) or more stories)Not responsible by City Dismissal1000.00.00.00.00.00.00.00.00.0NaTNO PAYMENT APPLIEDNaNNaNnot responsible by dispositionNaN
250300325558Buildings, Safety Engineering & Env DepartmentBush, WesleyWESTGATE TERRACE APARTMENTS LLC10701.0MEYERS RDNaN1715.0MEYERSDETROITMI48235NaNUSA2010-12-02 11:00:002015-01-06 09:00:009-1-43(a) - (StructuFail to comply with an Emergency or imminent danger order concerining an unsafe or unsanitary structure or unlawful occupancy (all other structures, except buildings with five (5) or more stories)Not responsible by City Dismissal1000.00.00.00.00.00.00.00.00.0NaTNO PAYMENT APPLIEDNaNNaNnot responsible by dispositionNaN
250301325555Buildings, Safety Engineering & Env DepartmentBush, WesleyWESTGATE TERRACE APARTMENTS LLC10701.0SANTA MARIANaN1715.0MEYERSDETROITMI48235NaNUSA2010-12-02 11:00:002015-01-06 09:00:009-1-43(a) - (StructuFail to comply with an Emergency or imminent danger order concerining an unsafe or unsanitary structure or unlawful occupancy (all other structures, except buildings with five (5) or more stories)Not responsible by City Dismissal1000.00.00.00.00.00.00.00.00.0NaTNO PAYMENT APPLIEDNaNNaNnot responsible by dispositionNaN
250302325557Buildings, Safety Engineering & Env DepartmentBush, WesleyWESTGATE TERRACE APARTMENTS LLC10701.0MEYERS RDNaN1715.0MEYERSDETROITMI48235NaNUSA2010-12-02 11:00:002015-01-06 09:00:009-1-43(a) - (StructuFail to comply with an Emergency or imminent danger order concerining an unsafe or unsanitary structure or unlawful occupancy (all other structures, except buildings with five (5) or more stories)Not responsible by City Dismissal1000.00.00.00.00.00.00.00.00.0NaTNO PAYMENT APPLIEDNaNNaNnot responsible by dispositionNaN
250303325562Buildings, Safety Engineering & Env DepartmentBush, WesleyWESTGATE TERRACE APARTMENTS LLC10701.0MEYERS RDNaN1715.0MEYERSDETROITMI48235NaNUSA2010-12-02 11:00:002015-01-06 09:00:009-1-43(a) - (StructuFail to comply with an Emergency or imminent danger order concerining an unsafe or unsanitary structure or unlawful occupancy (all other structures, except buildings with five (5) or more stories)Not responsible by City Dismissal1000.00.00.00.00.00.00.00.00.0NaTNO PAYMENT APPLIEDNaNNaNnot responsible by dispositionNaN
250304325559Buildings, Safety Engineering & Env DepartmentBush, WesleyWESTGATE TERRACE APARTMENTS LLC10701.0MEYERS RDNaN1715.0MEYERSDETROITMI48235NaNUSA2010-12-02 11:00:002015-01-06 09:00:009-1-43(a) - (StructuFail to comply with an Emergency or imminent danger order concerining an unsafe or unsanitary structure or unlawful occupancy (all other structures, except buildings with five (5) or more stories)Not responsible by City Dismissal1000.00.00.00.00.00.00.00.00.0NaTNO PAYMENT APPLIEDNaNNaNnot responsible by dispositionNaN
250305325561Buildings, Safety Engineering & Env DepartmentBush, WesleyWESTGATE TERRACE APARTMENTS LLC10701.0MEYERS RDNaN1715.0MEYERSDETROITMI48235NaNUSA2010-12-02 11:00:002015-01-06 09:00:009-1-43(a) - (StructuFail to comply with an Emergency or imminent danger order concerining an unsafe or unsanitary structure or unlawful occupancy (all other structures, except buildings with five (5) or more stories)Not responsible by City Dismissal1000.00.00.00.00.00.00.00.00.0NaTNO PAYMENT APPLIEDNaNNaNnot responsible by dispositionNaN